CDS

Accession Number TCMCG075C07507
gbkey CDS
Protein Id XP_017971596.1
Location join(35961810..35961899,35962005..35962074,35962202..35962299,35963201..35963339,35963476..35963603,35963710..35963802,35963930..35963980,35964906..35964971,35965083..35965148,35965335..35965415,35965514..35965594,35966810..35966977,35967077..35967158,35967267..35967388,35967860..35968222)
Gene LOC18609831
GeneID 18609831
Organism Theobroma cacao

Protein

Length 565aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018116107.1
Definition PREDICTED: putative clathrin assembly protein At2g01600 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category TU
Description Clathrin assembly protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K20043        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005886        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0071944        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGGGACTCTTCAAACATGGAGAAAAGCGTATGGCGCTCTTAAAGATACAACCAAAGTCGGTCTCGCTCATGTCAACAGTGATTACGCGGATTTGGATGTGGCTATAGTTAAAGCTACCAACCATGTTGAGTGTCCTCCCAAAGAAAGGCATCTTCGAAAAATCTTCATGGCCACATCAGCCATTCGGCCTCGAGCAGATGTTGCTTATTGCATTCATGCGCTTGCCCGGCGATTGGCCAAGACTCATAATTGGACGGTTGCCTTGAAAACACTCATAGTTATCCATAGGGCATTGAGGGAGGGTGATCCTACTTTCAGAGAAGAACTTTTAAACTTCTCACAAAGAGCACGTATTCTTCAACTTTCTAATTTCAAAGATGATTCTAGCCCTATTGCATGGGATTGCTCTGCCTGGGTACGTACATATGCATTGTTTTTGGAAGAAAGACTCGAATGCTTTAGGATTCTGAAGTATGACATTGAAGCTGAGCGTCTGCCAAGACCTGCCCAGGGGCAGGATAAGGGTTACAGCAGAACCAGGGAGTTGGACAGTGAAGAATTGTTGGAGCAATTGCCTGCTCTGCAGCAATTGCTCTATCGTCTTATTGGTTGCCGGCCAGAAGGTGCTGCTATAGGCAACTATGTTATACAGTATGCTTTGGCTCTGGTATTGAAGGAGAGCTTCAAAATATATTGTGCTATTAATGATGGAATTATCAATCTTGTCGACAAGTTTTTTGAGATGCCAAGGCATGAAGCTGTCACGGCACTTGATGTATACAAGCGAGCTGGTCAGCAGGCTAATAGCCTTTCTGATTTCTATGATGTTTGCAAAGGATTGGAACTTGCTAGGAACTTCCAGTTTCCTGTTCTCAGGGAGCCACCACAATCCTTTCTCACTACCATGGAAGAGTACATCAGAGAGGCACCACGTGTGGTTTCTGTTCCAACGGAACCATTGCTTCAATTAACATACAGACCTGAGGAAGGTCCCTCTGAAGATACTAAATTATCCAATGATGAACCTGAGCCATCTGCTCTTGCTGATGATATTGCTGTTTCTGGTGTTGAGACTGTTCCGGTTCCCCCTCCTCTACCTCAGAACAATGCGGATGGTGGAGACTTACTGGACTTGAGTTATTCTGCCCCTGATGCCTTGGCAATTGAGGAAAGTAATGCTTTAGCTCTAGCCATAGTTCCTACTGAACCTGGTACTGGTCCAACATTTAATTCTACAACTGGTCAACCAAAAGATTTTGATCCTACTGGATGGGAACTTGCCCTTGTCACCACACCAAGTAGCGATATTTCTGCAGTTAATGATAGGCAATTGGCTGGTGGGTTGGACTCGCTCACTCTCAACAGCTTGTATGATGAAGCAGCATATAGAGCTTCTCAGCAGCCTGTATATGGAGCTCCAGCTCCAAATCCGTTTGAGGTACAAGACCCATTTGCCATGTCAAATAACATTGCTCCCGCTAGAGCAGTTCAAATGGCAGCAATGGCTCAACCGCAAAGCAATCCCTTTGGTCCATACCAACCTACCTATCAGCAGCCACTGCAGCAGCAACATATGATGATGAGCCCATCAAATCCATTTGGTGATGCAGGGTTTGGGGCATTTCCAGTGAACCAAATGCCCCCTGTTGCTCAGCCACATGCTAATAATCCATTTGGAAGCACAGGCCTTTTGTAA
Protein:  
MGTLQTWRKAYGALKDTTKVGLAHVNSDYADLDVAIVKATNHVECPPKERHLRKIFMATSAIRPRADVAYCIHALARRLAKTHNWTVALKTLIVIHRALREGDPTFREELLNFSQRARILQLSNFKDDSSPIAWDCSAWVRTYALFLEERLECFRILKYDIEAERLPRPAQGQDKGYSRTRELDSEELLEQLPALQQLLYRLIGCRPEGAAIGNYVIQYALALVLKESFKIYCAINDGIINLVDKFFEMPRHEAVTALDVYKRAGQQANSLSDFYDVCKGLELARNFQFPVLREPPQSFLTTMEEYIREAPRVVSVPTEPLLQLTYRPEEGPSEDTKLSNDEPEPSALADDIAVSGVETVPVPPPLPQNNADGGDLLDLSYSAPDALAIEESNALALAIVPTEPGTGPTFNSTTGQPKDFDPTGWELALVTTPSSDISAVNDRQLAGGLDSLTLNSLYDEAAYRASQQPVYGAPAPNPFEVQDPFAMSNNIAPARAVQMAAMAQPQSNPFGPYQPTYQQPLQQQHMMMSPSNPFGDAGFGAFPVNQMPPVAQPHANNPFGSTGLL